Improving spoken document retrieval by unsupervised language model adaptation using utterance-based web search

نویسندگان

  • Robert Herms
  • Marc Ritter
  • Thomas Wilhelm-Stein
  • Maximilian Eibl
چکیده

Information retrieval systems facilitate the search for annotated audiovisual documents from different corpora. One of the main problems is to determine domain-specific vocabulary like names, brands, technical terms etc. by using general language models (LM) especially in broadcast news. Our approach consists of two steps to overcome the out-of-vocabulary (OOV) problem to improve the spoken document retrieval performance. Therefore, we first separate the resulting transcript of a speech recognizer into blocks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a Spoken Document Retrieval Application in the Oral History Domain

The application of automatic speech recognition in the broadcast news domain is well studied. Recognition performance is generally high and accordingly, spoken document retrieval can successfully be applied in this domain, as demonstrated by a number of commercial systems. In other domains, a similar recognition performance is hard to obtain, or even far out of reach, for example due to lack of...

متن کامل

Unsupervised Language Model Adaptation using Utterance-based Web Search for Clinical Speech Recognition

In this working notes paper we present our methodology in clinical speech recognition for the Task 1.a.1 of the CLEF eHealth Evaluation Lab 2015. The goal of this task is to minimize the worddetection errors. Our approach is based on the assumption that each spoken clinical document has its own context. Hence, the recognition system is adapted for each document separately. The proposed method p...

متن کامل

Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task

This paper presents speech-driven Web retrieval models which accept spoken search topics (queries) in the NTCIR-3 Web retrieval task. The major focus of this paper is on improving speech recognition accuracy of spoken queries and then improving retrieval accuracy in speechdriven Web retrieval. We experimentally evaluated the techniques of combining outputs of multiple LVCSRmodels in recognition...

متن کامل

PodCastle: Collaborative Training of Language Models on the Basis of Wisdom of Crowds

This paper presents a language-model training method for improving automatic transcription of online spoken contents. Unlike previously studied LVCSR tasks such as broadcast news and lectures, large-sized task-specific corpora for training language models cannot be prepared and used in recognition because of the diversity of topics, vocabularies, and speaking styles. To overcome difficulties in...

متن کامل

Dynamic language model adaptation using keyword category classification

This paper describes a language model adaptation method for improving speech recognition of keywords in spoken queries occurring in information retrieval tasks. The method dynamically adapts language models to keyword categories within a single utterance; it first estimates keyword categories and their positions in an input query utterance and then dynamically changes the weights for language m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014